Dependent Video Indexing Based on Audio - VisualInteractionS

نویسندگان

  • S Tsekeridou
  • I Pitas
چکیده

A content-based video indexing method is presented in this paper that aims at temporally indexing a video sequence according to the actual speaker. This is achieved by the integration of audio and visual information. Audio analysis leads to the extraction of a speaker identity label versus time diagram. Visual analysis includes scene cut detection, face shot determination, mouth region extraction and tracking and nally talking face shot determination. Results from both sources are combined to improve speaker-dependent video indexing. Such a task enables exi-ble video retrieval or browsing in cases where queries according to speaker identities are imposed. Speaker recognition errors are reduced to 2%.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Different Indexing Techniques

This paper describes about Audio Indexing, Video Indexing, Content Based Image Indexing, and Content Based Multimedia Indexing i.e. Content-based indexing techniques. Indexing is concerned with compactly storing a large collection of terms and rapidly retrieving a set of candidate terms satisfying some property from a large collection of terms. Index is a structure or object in the database Ind...

متن کامل

Content-Based Indexing for Search and Browsing

Storage and archiving of digital video in shared disks and servers in large volumes, browsing of such databases in real-time, and retrieval over switched and packet networks pose many new challenges, one ofwhich is efficient and effective description of content. The simplest method to index content is by means of a thesaurus of keywords, which can be assigned manually or semiautomatically to pr...

متن کامل

Audio-based Multimedia Indexing and Retrieval Scheme in Muvis Framework

MUVIS is a PC-based framework, which supports indexing, browsing and querying of various multimedia types such as audio, video, audio/video interlaced in several formats. It allows real-time audio and video capturing, encoding by last generation codecs such as MPEG-4, H.263+, MP3 and AAC. MUVIS also supports several audio/video file format such as AVI, MP4, MP3 and AAC. Almost all image types i...

متن کامل

Audio-visual Content-based Multimedia Indexing and Retrieval – the Muvis Framework

MUVIS is a collaborative framework that supports indexing, browsing and querying of various multimedia types such as audio, video, audio/video interlaced in several formats. It allows real-time audio and video capturing, encoding by last generation codecs such as MPEG-4, H.263+, MP3 and AAC. MUVIS also supports several audio/video file format such as AVI, MP4, MP3 and AAC. MUVIS achieves a glob...

متن کامل

Gender identification using a general audio classifier

In the context of content-based multimedia indexing gender identification using speech signal is an important task. Existing techniques are dependent on the quality of the speech signal making them unsuitable for the video indexing problems. In this paper we introduce a novel gender identification approach based on a general audio classifier. The audio classifier models the audio signal by the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998